Rank in Wordlist | Frequency | Word |
---|---|---|
2977 | 43197 | 1,5 |
4296 | 29063 | 2,5 |
5411 | 22195 | 1,2 |
5771 | 20624 | 3,5 |
6193 | 18951 | 1,3 |
6588 | 17637 | 1,6 |
6730 | 17280 | 1,4 |
7054 | 16261 | 1,8 |
7571 | 14912 | 1,7 |
7680 | 14661 | 1,1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
345895 | 87 | Coop Himmelb(l)au |
371228 | 78 | (T)Raumschiff Surprise |
472126 | 54 | Bis(s) zum Ende der Nacht |
582688 | 39 | Breaking Dawn - Bis(s) zum Ende der Nacht |
675304 | 31 | Bis(s) zum Abendrot |
738095 | 27 | Bis(s) zum Morgengrauen |
842476 | 22 | Eclipse - Bis(s) zum Abendrot |
944741 | 19 | Wa(h)re Liebe |
1022633 | 16 | (I Can't Get No) Satisfaction |
1090185 | 15 | Museum für Energiegeschichte(n) |
Rank in Wordlist | Frequency | Word |
---|---|---|
274934 | 123 | .) |
345895 | 87 | Coop Himmelb(l)au |
371228 | 78 | (T)Raumschiff Surprise |
472126 | 54 | Bis(s) zum Ende der Nacht |
582688 | 39 | Breaking Dawn - Bis(s) zum Ende der Nacht |
656702 | 33 | Sunn O))) |
675304 | 31 | Bis(s) zum Abendrot |
738095 | 27 | Bis(s) zum Morgengrauen |
842476 | 22 | Eclipse - Bis(s) zum Abendrot |
944741 | 19 | Wa(h)re Liebe |
Rank in Wordlist | Frequency | Word |
---|---|---|
16005 | 5989 | 10% |
16122 | 5932 | 100% |
16199 | 5889 | 50% |
17789 | 5242 | 20% |
22126 | 3998 | 30% |
23004 | 3806 | 5% |
25052 | 3417 | 80% |
25423 | 3360 | 40% |
28163 | 2948 | 90% |
29035 | 2836 | 25% |
Rank in Wordlist | Frequency | Word |
---|---|---|
12091 | 8448 | & Co |
15937 | 6018 | S&P |
23010 | 3805 | H&M |
25698 | 3317 | GmbH & Co. KG |
35120 | 2208 | S&P 500 |
36782 | 2074 | Standard & Poor's |
43261 | 1674 | AT&T |
48196 | 1447 | Ernst & Young |
50926 | 1342 | C&A |
65001 | 966 | S&T |
Rank in Wordlist | Frequency | Word |
---|---|---|
432776 | 62 | Ke$ha |
495752 | 50 | A$AP |
591503 | 38 | A$AP Rocky |
1056033 | 16 | Ty Dolla $ign |
1277672 | 12 | U$A |
1986694 | 6 | Mu$eum |
2044341 | 6 | US-$/Unze |
2247830 | 5 | N$/kWh |
2247831 | 5 | N$10 |
2247832 | 5 | N$20 |
Rank in Wordlist | Frequency | Word |
---|---|---|
171 | 762493 | ." |
570056 | 41 | Toys "R" Us |
693474 | 30 | Hochschule für Schauspielkunst "Ernst Busch" |
899475 | 20 | Hochschule für Musik "Hanns Eisler" |
1633302 | 8 | Lee "Scratch" Perry |
1826931 | 7 | Stanley "Tookie" Williams |
1828007 | 7 | Stiftung "Erinnerung, Verantwortung und Zukunft" |
2193371 | 5 | Hochschule für Schauspielkunst "Ernst Busch" Berlin |
2540553 | 4 | Hochschule für Musik "Carl Maria von Weber" |
2540557 | 4 | Hochschule für Musik und Theater "Felix Mendelssohn Bartholdy" Leipzig |
Rank in Wordlist | Frequency | Word |
---|---|---|
6941 | 16632 | gibt's |
8329 | 13195 | geht's |
12489 | 8111 | Let's |
13711 | 7240 | .' |
13804 | 7173 | Let's Dance |
19717 | 4602 | 100'000 |
20635 | 4354 | 10'000 |
21397 | 4171 | Germany's |
21663 | 4105 | gab's |
23033 | 3801 | McDonald's |
Rank in Wordlist | Frequency | Word |
---|---|---|
24059 | 3602 | K+S |
62315 | 1023 | Google + |
79379 | 733 | 50+1-Regel |
87238 | 645 | 90.+1 |
87536 | 642 | 90.+2 |
99203 | 540 | 90.+3 |
99953 | 535 | Gruner + Jahr |
113260 | 448 | 45.+1 |
119253 | 417 | Kühne + Nagel |
123673 | 396 | Kühne+Nagel |
Rank in Wordlist | Frequency | Word |
---|---|---|
370091 | 79 | Sagittarius A* |
441507 | 60 | Get the F*ck out of my House |
1052297 | 16 | Sgr A* |
1660098 | 8 | Sag A* |
6054888 | 1 | Berufsverband Bildender Künstler*innen Berlin |
Rank in Wordlist | Frequency | Word |
---|---|---|
3715 | 34105 | km/h |
5524 | 21576 | dpa/tmn |
8327 | 13198 | Frankfurt/Main |
10480 | 10107 | und/oder |
12847 | 7828 | 2018/19 |
13416 | 7426 | CDU/CSU |
14854 | 6557 | 2017/18 |
16452 | 5779 | https://www |
16899 | 5592 | 90/Die |
17428 | 5391 | Bündnis 90/Die Grünen |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots